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"Expression Control Polynucleotides d erived from 
Spliceosomal Protein Gene Pro moters" 

This invention relates to expression control 
polynucleotides derived from spliceosomal protein gene 
promoters . 

It is already well known that organisms such as plants 
or animals with novel characteristics can be produced 
by introducing genes or DNA sequences from the same or 
a related organism or by introducing genes or DNA 
sequences from other organisms. The key to production 
of novel phenotypes is the active expression of at 
least a part of the introduced DNA sequence. Generally 
expression of the introduced DNA sequence will occur 
only in the presence of an expression control 
polynucleotide, such as a promoter, which is compatible 
with the host organism. Promoters are nucleic acid 
sequences which are currently believed to regulate the 
expression of a gene by facilitating the binding of 
proteins required for transcription, such as RNA 
polymerase, to a portion of the nucleic acid sequence 
upstream of the gene- Thus for protein coding genes 
the DNA is transcribed into mRNA (messenger RNA) which 
is then relocated to the cytoplasm where it is 
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2 

1 available for translation into polypeptide. 
2 

3 Whereas promoters are generally disposed upstream of 

4 the genes they regulate and are thought to act by 
5. providing a binding site for an RNA polymerase 

6 transcription complex, another form of expression 

7 control polynucleotide known as an "enhancer" sequence 

8 can control the expression of a gene usually without 

9 regard to its relative position or orientation. 
10 

11 Promoters can vary in levels of expression induction 

12 and in their expression patterns. Some promoters are 

13 active in a tissue-specific or developmental stage- 

14 specific manner while other promoters are active 

15 constantly to continually drive expression of the gene 

16 they control. Such constantly active promoters are 

17 called constitutive promoters. The promoters of this 

18 latter class most widely used in genetic engineering 

19 are the Cauliflower Mosaic Virus (CaMV) 35S RNA 

20 promoter, nopaline synthetase (nos) or octopine 

21 synthetase (ocs) promoters. Constitutive promoters 

22 induce gene expression at a relatively constant rate. 
23 

24 Although the above conventional promoters generally 

25 drive expression at high levels, many have the 

26 disadvantage that they are derived from plant 

27 infectious agents; the CaMV 3 5S RNA promoter is derived 

28 from a plant virus and the other promoters mentioned 

29 are derived from strains of AgrroJbacterium, soil-borne 

30 infectious bacteria. The source of these promoters is 

31 a cause of concern in transgenic plant production. In 

32 addition, detailed analysis of expression patterns of 

33 CaMV 3 5S promoter, have shown that its levels of 

34 expression can vary greatly among different plant 

35 tissues to the level where it is inactive in some 

36 tissues and is therefore, no longer constitutive for 
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1 such tissue. For many biotechnological objectives, 

2 constitutive expression in all cells and tissues would 

3 be of great advantage. 
4 

5 - Splicesomal proteins are believed to be present in 

6 virtually all eukaryotic cells and are involved in the 

7 phenomenon of pre-mRNA splicing which removes introns 

8 (non-coding regions) from RNA transcripts before 

9 protein production. 
10 

11 The present invention seeks to provide a promoter (or 

12 other expression control polynucleotide) which is not 

13 derived from an infectious agent and which is suitable 

14 for use in the control of expression of recombinant 

15 genes in the construction of transgenic organisms such 

16 as plants and animals. 
17 

18 The present invention also seeks to provide a promoter 

19 (or other expression control polynucleotide) which is 

20 likely to be active throughout all or most cells of the 

21 organism. 

22 

23 According to the present invention there is provided an 

24 expression control polynucleotide at least partially 

25 derived from a spliceosomal protein gene promoter. 
26 

27 The expression control polynucleotide of the invention 

28 is capable of controlling the expression of a second 

29 polynucleotide (preferably comprising a polypeptide- 
3 0 encoding sequence) operably linked thereto* 

31 

32 RNA sequences which do not code for protein can also be 

3 3 expressed eg ribozymes or anti-sense RNA. 

34 

3 5 The term "expression control polynucleotide" as used 

3 6 herein will include promoters, enhancers or any other 
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1 functional equivalents or any other sequence elements 

2 which affect expression of other gene sequences. 
3 

4 The term "polypeptide-encoding sequence" as applied 

5 herein to polynucleotides means a polynucleotide 

6 comprising a sequence which can be transcribed into 

7 mRNA, which itself can be translated into a 

8 polypeptide. The "polypeptide-encoding sequence" may 

9 include non-translated portions, such as introns. 
10 

11 The spliceosomal protein gene promoter may be derived 

12 from plants. The plants may be dicotyledonous (eg 

13 potatoes) , or monocotyledonous (eg maize) * 
14 

15 The present invention also provides a recombinant 

16 polynucleotide comprising an expression control 

17 polynucleotide according to the invention operably 

18 linked to a second polynucleotide (preferably 

19 comprising a polypeptide-encoding sequence) . 
20 

21 The present invention also provides a recombinant 

22 vector containing an expression control polynucleotide 

23 or a recombinant polynucleotide as defined above. 
24 

25 According to the present invention there is also 

26 provided a method of producing a recombinant vector, 

27 said method comprising ligating an expression control 

28 polynucleotide into a vector or part thereof. A method 

29 of producing a transformed cell by transfecting a host 

3 0 cell using said recombinant vector forms another aspect 
31 of the invention. 

32 

33 The present invention also provides a transformed host 

34 cell containing a recombinant polynucleotide or vector 
3 5 as defined above. 

36 
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1 The present invention also provides a transgenic 

2 organism (for example a transgenic plant) containing a 

3 recombinant polynucleotide or vector as defined above. 

4 The progeny (and seeds) of such transgenic organisms 

5 forms a further part of the invention. 

6 

7 The present invention also provides a method for 

8 controlling the expression of a polypeptide from a 

9 nucleotide sequence encoding the polypeptide, said 

10 method comprising operably linking said sequence to an 

11 expression control polynucleotide of the invention. 

12 

13 The expression control polynucleotide of the invention 

14 may comprise double- or single-stranded DNA or RNA. 
15 

16 Three cultures of E.coll (SCRI/JB/1, SCRI/JB/2 and 

17 SCRI/JB/3), each containing a plasmid having an 

18 expression control polynucleotide according to the 

19 invention were deposited on 14 March 1994 with the 

20 National Collection of Type Cultures under numbers NCTC 

21 12864, NCTC 12865 and NCTC 12866 respectively. 
22 

23 Cultures SCRI/JB/1 and 2 contain dicotyledonous 

24 spliceosomal protein gene expression control 

25 polynucleotides (promoters for potato U1A and U2B" 

26 genes respectively) ; SCRI/JB/3 contains a 

27 monocotyledonous expression control polynucleotide 

28 (promoter for a maize PRP8 gene) . 
29 

30 Accordingly, the present invention also provides NCTC 

31 deposits Nos 12864, 12865 and 12866 and the plasmids 
3 2 thereof . 

33 

34 Recombinant DNA technology has been recognised as a 

35 powerful technique not only in research but also for 

36 commercial purposes. Thus, by using recombinant DNA 
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1 techniques (see Sambrobk et al 1982 and "Principles of 

2 Genetic Engineering", Old and Primrose, 5th edition, 

3 1994) exogenous genetic material can be transferred to 

4 a host cell and the polypeptide encoded by the 

5 exogenous genetic material may be replicated by and/or 

6 expressed within the host. For the purposes of 

7 simplicity recombinant DNA technology is normally 

8 carried out with prokaryotic micro-organisms, for 

9 example bacteria such as E. coli, as host. However, 

10 use has also been made of eukaryotic organisms, in 

11 particular yeasts or algae, and in certain applications 

12 eukaryotic cell cultures may also be used. 

13 

14 Genetic alterations to mammalian species by micro- 

15 injection of genes into the pro-nuclei of single-cell 

16 embryos is also well known and has been described by 

17 Brinster et al, in Cell 27: 223-231, 1981. Thus 

18 general techniques used in recombinant DNA technology 

19 and the production of transgenic organisms is within 

20 the scope of the skilled man. 
21 

22 Using a cDNA sequence (complimentary DNA - reverse 

23 transcribed from mRNA) , either full-length or partial, 

24 as a probe for a gene of interest, the gene promoter 

25 can be readily isolated by standard procedures. 

26 Briefly, the cDNA probe is first used to screen a 

27 genomic library to isolate a genomic clone containing 

28 the promoter and coding sequence. Restriction mapping 

29 and Southern blotting with the cDNA as probe delineates 

30 the region of the genomic clone containing the coding 

31 sequence. Sequencing of this region of the genomic 

32 clone and comparison to the cDNA clone will identify 
3 3 the translation initiation ATG codon, and if the cDNA 

34 is full-length, will give an indication of the 

35 transcription start site, upstream of which lies the 

3 6 gene promoter. Important promoter sequence elements may 
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1 lie in excess of around 2 kbp of the transcription 

2 start site. Therefore, a genomic fragment of preferably 

3 1-5 kbp is isolated for initial testing of promoter 

4 activity. For example, in the case of U1A and U2B" , a 

5 monoclonal antibody (mAb) , 4G3 , raised against a b- 

6 galactosidase-human U2B" protein , was used as a probe 

7 to screen a potato cDNA expression library and a full- 

8 length cDNA clone was isolated (Simpson et al . 1991) . 

9 Screening of a potato genomic library using this cDNA 

10 clone resulted in a genomic clone containing part of 

11 the gene and around 15 kbp upstream sequences which 

12 were used to clone the promoter in subsequent 

13 experiments. The extreme similarity between U1A and 

14 U2B" enabled us to isolate a full-length genomic clone 

15 of the U1A gene. Since the latter clone contained 

16 around 7 kbp of upstream sequences, it was possible to 

17 clone the promoter from the upstream region. 
18 

19 The use of spliceosomal protein gene promoters to drive 

20 expression of DNA sequences in transgenic plants or 

21 animals has the advantage that the expression is 

22 constitutive, expressed in all cell and tissue types 

23 and uses naturally occurring plant or animal nucleic 

24 acid sequences which are not derived from infectious 

25 agents, 
26 

27 While further modifications and improvements may be 

28 made without departing from the scope of this 

29 invention, the following is a description of one or 

3 0 more examples of the invention, with reference to the 

31 accompanying drawings, in which: - 

32 

33 Fig. 1 shows a schematic diagram of a potato U2B" 

34 genomic clone present in the cells of culture No 
3 5 SCRI/JB/2; 

26 Fig. 2 shows a schematic diagram of a potato U1A 
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genomic clone present in the cells of culture No 
SCRI/JB/l; 

Fig. 3 shows a schematic diagram of a maize PRP8 
genomic clone present in the cells of culture No 
SCRI/JB/3 ; 

Fig. 4 shows a schematic diagram of a UIA/GUS 
expression cassette; 

Fig. 5 shows a schematic diagram of a U2B M /GUS 
expression cassette; 

Fig. 6 shows a graph of fluorescence against time 
plotted from data collected from a fluor ©metric 
assessment of GUS gene expression controlled by 
potato spliceosomal protein gene promoters; 
Fig. 7 shows a schematic diagram of a UIA/GUS 
expression cassette; 

Fig. 8 shows a schematic diagram of U2B n /GUS 
expression cassette; 

Fig. 9 shows a schematic diagram of the genomic 
organisation of the potato U1A gene; and 
Fig. 10 shows a schematic diagram of the genomic 
organisation of the potato U2B" gene. 

Example 1 

Genes for two plant spliceosomal proteins from the 
dicotyledonous plant, potato (Solanum tuberosum) were 
isolated encoding the spliceosomal proteins U1A and 
U2B". 

The promoter of a potato U2B" gene was isolated from a 
potato genomic library in X EMBL 3 by conventional 
methods. The library was screened with a potato U2B" 
cDNA clone (Simpson et al., 1991) using standard 
procedures (Sambrook et al., 1989). The potato U2B" 
genomic clone was plaque-purified, DNA prepared, 
fragments subcloned into plasmid vectors and DNA 
sequenced by standard procedures (Sambrook et al. 
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1 1989) . 
2 

3 The genomic clone contained an insert of 15 kilobase 

4 pairs (kbp) (Fig. 1) from which relevant sub-fragments 

5 . were cloned into plasmid vectors (Fig. 1) . The clone 

6 only contained a fragment of coding region of U2B" (100 

7 bp), a fragment of the first intron in U2B" (100 bp) 

8 and approximately 15 kbp of upstream sequences 

9 containing the U2B" promoter. 
10 

11 Example 2 

12 The promoter of a potato U1A gene was isolated from a 

13 ' potato genomic library in X EMBL 4. The library was 

14 screened with a potato U2B" cDNA clone (Simpson et al. , 

15 1991) because U1A and U2B" are closely related, and a 

16 genomic clone was obtained by known methods. The 

17 genomic clone was characterised by state-of-the-art 

18 methodologies as described by Sambrook et al., (1989). 

19 The clone contained an insert of 15 kbp (Fig. 2) from 

20 which relevant sub-fragments were cloned into plasmid 

21 vectors by standard techniques (Sambrook et al., 1989). 

22 The clone contained the whole of the U1A coding 

23 sequence on five exons, four introns and 7 and 2 kbp of 

24 5' and 3' flanking sequence respectively. 
25 

26 The promoter regions can be linked to marker genes such 

27 as bacterial ^-glucuronidase (Jefferson, 1987) by 

28 standard molecular techniques (Sambrook et al., 1989). 

29 Promoter constructs can be analyzed by introduction 

3 0 into plant cells by known methodology such as chemical 
31 or electrical transf ection, microinjection, biolistics 
3 2 or Agrobacterium-mediated or other vector-mediated 

33 transformation (see Shaw, 1988). Transgenic plants 

34 containing the construct can be analyzed by detecting 

35 the presence of GUS enzyme and thereby its expression 

36 using known methods of histochemical staining 
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1 (Jefferson, 1987) . Levels of expression of marker 

2 genes driven by the promoters in either stably 

3 transformed plants or transiently transformed plant 

4 cells or protoplasts can be assessed by comparison with 

5 levels of endogenous gene expression and of marker gene 

6 expression driven by the Cauliflower Mosaic Virus 35S 

7 RNA promoter. This analysis can use known, state-of- 

8 the-art methodologies to detect RNA transcripts 

9 (Sambrook et aL, 1989; Simpson et al., 1992) and to 

10 detect production of enzyme from marker genes, for 

11 example, for GUS marker gene activity. 
12 

13 The promoter region can be incorporated into plasmid 

14 vectors designed for general use in construct 

15 production in E.coli, and for use in stable, 

16 Agrobacteriuin-mediated transformation and in transient 

17 transformation or stable, physical transformation 

18 methods. DNA sequences to be expressed in the 

19 transgenic plant can be inserted behind the promoter 

20 regions as is currently commonly performed using the 

21 CaMV 3 5S RNA promoter (see Shaw, 1988) prior to 

22 introduction into plant cells or production of 

23 transgenic plants. 
24 

25 Example 3 

26 A gene for a plant spliceosomal protein was cloned from 

27 the monocotyledonous plant, maize (Zea mays L.)/ which 

28 encodes the spliceosomal protein PRP8 (Jackson et al., 

29 1988) by virtue of sequence homology to PRP8 of yeast. 
30 

31 The genomic clone of maize PRP8 was isolated from a 

32 maize genomic library constructed in X EMBL 4 by 

33 conventional methods. The library was screened with a 

34 fragment of maize PRP8 generated by state-of-the-art 

35 methodologies, such as polymerase chain reaction (PCR) 
3 6 amplification using oligonucleotide sequences designed 
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1 from the yeast and Caenorhabditis elegans PRP8 DNA 

2 sequences, cloning and sequencing. The maize PRP8 

3 genomic clone was plaque purified, DNA prepared and 

4 fragments subcloned into plasmid vectors by standard 

5 procedures . 
6 

7 The PRP8 promoter region can be incorporated into 

8 plasmid vectors as previously described, 
9 

10 Example 4 

11 Promoter/ GUS Constructions 

12 Two expression cassettes containing the jS-glucuronidase 

13 (GUS) gene driven by the potato U1A and U2B" 

14 splicesomal protein gene promoters respectively were 

15 constructed. Both cassettes contained the nopaline 

16 synthase poly A (NOS-ter) downstream to the GUS gene. 

17 The construction of these expression cassettes was 

18 achieved by replacing the CaMV 3 5S promoter in the 

19 vector pBI221 by upstream sequences of the potato U1A 

20 and U2B" spliceosoraal protein genes. Genomic clones 

21 described above were used. 
22 

23 UIA 

24 A 4.5 kb PstJ fragment containing about 2.5 kb upstream 

25 to the UIA gene, the first and the second exons f the 

26 first intron and part of the second intron was used to 

27 clone the UIA promoter. In order to clone the 

28 promoter, a site specific mutation was generated which 

29 changed the sequence ATGGCG at the translation start 

30 site into TCTAGA to introduce an Xbal restriction site. 

31 Subsequently, the entire 2.5 kb upstream region was 

32 cloned into pBI221 after eliminating the 35S promoter 

33 using the restriction enzymes PstI and Xbal (fig- 4) . 
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1 U2B" 

2 An EcoRI/Sa.11 fragment containing about 2 kb upstream 

3 to the U2B" gene, the entire first exon, and part of 

4 the first intron was used to clone the U2B n promoter, 

5 Appropriate restriction sites were introduced into the 

6 upstream region using PCR amplification. The first PCR 

7 primer started a few nucleotides downstream of the 

8 EcoRI site and contained additional nucleotides in its 

9 5' end to provide PstI and Bglll sites. The second 

10 primer started two nucleotides upstream of the ATG 

11 start codon and contained additional nucleotides 

12 providing a site for cleavage by BamHI at its 5' end. 

13 After PCR amplification and digestion with BamHI and 

14 PstI, the 2 kb upstream region was cloned into pBI221 

15 replacing the 35S promoter (fig. 5) . 
16 

17 Protoplast Isolation 

18 The constructed cassettes were tested for promoter 

19 activity in transient gene expression assays using 

20 tobacco protoplasts. Protoplasts were prepared from 

21 young, fully expanded tobacco leaves. Leaves were 

22 placed in 7% Domestos for 10 minutes, washed with 

23 sterile tap water, dried, and peeled to remove the 

24 lower epidermis. Peeled leaf pieces were placed onto 

25 15 ml enzyme solution in a sterile Petri dish [enzyme 

26 solution: 1 mg/ml cellulase, 0.5 mg/ml driselase, 0.2 

27 mg/ml macerase, suspended in TO" (see appendix) ] and 

28 incubated overnight in the dark at 25 °C. Protoplasts 

29 were transferred into two sterile 10 ml tubes through 

30 sterile sieves and spun at 400 rpm for 5 minutes. 

31 After resuspending each protoplast pellet in 10 ml TO", 

32 5 ml aliquots were each layered onto 2 . 5 ml 16% sucrose 

33 and spun at 800 rpm for 5 minutes. The purified 

34 protoplasts were resuspended in 10 ml TO", spun at 400 
3 5 rpm for 5 minutes, and recollected in 5 ml TO". 

36 



WO 95735386 PCT/GB95/01443 

13 

1 Protoplast Transf ection 

2 Transfection of the protoplasts with the U1A/GUS and 

3 U2B"/GUS constructs was achieved using plasmid DNA 

4 purified by Qiagen (Trade Mark) columns- For each 

5 experiment 200 Ml of the purified protoplasts were 

6 transfected with 30 /ig DNA dissolved in 20 pi distilled 

7 water. After dropwise addition of the DNA and careful 

8 homogenization of protoplasts, 200 ml of PEG solution 

9 was added. [PEG solution: 25% PEG 8000, 0.1M Ca(N0 3 ) 2 , 

10 0.45M Mannitol, 10 mM MES: 2- (N-Morpholino) 

11 ethanesulf onic acid] was added dropwise. The 

12 transfected protoplasts were then incubated at ambient 

13 temperature for 20 minutes; subsequently, 4 ml calcium 

14 nitrate was carefully added [0,275 M Ca(N0 3 ) 2 ]. After 

15 an incubation period of 20 minutes at ambient 

16 temperature protoplasts were spun at 4 00 rpm for 3 

17 minutes, suspended, and resuspended in 5 ml TO 4 *. The 

18 transfected protoplasts were then incubated in sealed 

19 Petri dishes in the light at 25 °C for 48 hours. 
20 

21 GUS Assays 

22 Samples were collected by spinning the protoplasts at 

23 400 rpm for 3 minutes, resuspended, transferred into 

24 Eppendorf tubes (1 ml each) , and spun again at 1000 rpm 

25 for 2 minutes in a cold microcentrifuge (4°C) . The 

26 pellet was then resuspended in 200 /xl extraction and 

27 reaction buffer (50 mM NaP0 4 pH 7.0, 10 mM EDTA, 0.1% 

28 Triton X-100, 10 mM 0 -mercaptoethanol) , spun at 1300 

29 rpm for 3-5 minutes, and the supernatant was stored in 

30 clean Eppendorf tubes. 50 pi sample extract was added 

31 to 220 pi extraction and reaction buffer in an 

32 eppendorf tube and preincubated at 37°C for 5 minutes. 

33 At 15 sec intervals, 30 /il 1 mM MUG (4 -methyl 

34 umbelliferyl glucuronide) substrate was added to each 
3 5 tube giving time to stop reactions at accurate 

3 6 intervals. At defined time points, 100 jlx! samples were 
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1 taken and added to 2.9 ml stop solution (0.2 M Na 2 C0 3 ) 

2 in a clean cuvette. At this stage, GUS activity could 

3 be inspected visually by transillumination using a long 

4 wave UV light box. Quantitative measurements of 

5 fluorescence were made using a fluorometer. 
6 

7 Protoplast transf ections and GUS assays have been 

8 performed to investigate the expression of the GUS gene 

9 driven by splicesomal protein gene promoters using the 

10 above described constructs. In all experiments, the 

11 vector pBI221 which contains the GUS gene driven by the 

12 CaMV 3 5S promoter and terminated by the nopaline 

13 synthase poly A was used as a positive control. 

14 Negative controls were a: extracts from protoplasts 

15 transf ected with water instead of plasmid DNA; and b: 

16 MUG substrate added to the extraction and reaction 

17 buffer without the addition of any extracts. Reactions 

18 were carried out for 3 0 minutes, 1 hour, 2 hours, 3 

19 hours, 4 hours, 5 hours and 20 hours. In order to 

20 assess the activity of the tested promoters in 

21 comparison to the 3 5S promoter, fluorometer readings 

22 were set at 100 for pBI221 at each time point so that 

23 the readings obtained for U1A and U2B" indicated the 

24 GUS activity in each case relative to that obtained for 

25 the 3 5S promoter. The results which are illustrated in 

26 fig 6 show that the U2B" promoter has about 70% of the 

27 activity of the 3 5S promoter whereas the U1A promoter 

28 gave rise to 10-15% of the activity observed for the 

29 35S promoter. 
30 

31 Figs. 9 and 10 show the genomic organisation of the 

32 original genomic clones made of potato genes U1A and 

33 U2B". The promoter of the U1A gene is believed to lie 

34 in the region between the ATG initiation codon and the 
3 5 PstI site 2.5 kb upstream of the coding sequence 

3 6 initiating at the ATG start codon. The promoter of the 



WO 95/35386 PCT/GB95/01443 

15 

1 U2B" gene is believed to lie in the region between the 

2 ATG coding region and the EcoRI site 2.0 Jcb upstream of 

3 the coding sequence initiated by the ATG start codon. 
4 

5 Figs. 9 and 10 indicate regions of the genomic clones 

6 containing the promoters of U1A and U2B" which have 

7 been sequenced. The sequences of those regions are 

8 shown in the sequence listing. 
9 

10 The partial U1A promoter sequence is shown on the 

11 sequence listing as SEQ:ID:Nol. Sequence data is 

12 presented for the 3 ' end of the region between the ATG 

13 codon and a Pstl site 2.5kb upstream. About l.7kb of 

14 the 5' end of this region has not been sequenced. The 

15 partial sequence of the U2B" genomic clone is shown in 

16 the sequence listing as three separate sequences since 

17 incomplete sequence data is available for this 

18 promoter. The first of the U2B" sequences shown is 

19 SEQ:ID:No2 which corresponds to the region from the 5' 

20 end TCT to a TAA 119 bases downstream. After the 3 ' 

21 end of SEQ:ID:No2 there is a region of approximately 

22 900 bases which have not been sequenced. After the 3' 

23 end of the 900bp unidentified region, the next U2B" 

24 sequence portion available is SEQ:ID:No3 which runs 

25 from a TCT approximately 900 bases downstream of the 

26 end of SEQ:ID:No2 to an ATC a further 13 9 bases 

27 downstream. After the 3' end of SEQ:ID:No3 there is a 

28 region of 11 bases which have not been identified. 

29 This unidentified region is followed by the last 

30 sequenced portion of the U2B" clone which is designated 

31 as SEQ:ID:No4 which starts at ATA at its 5' end and 

32 ends at the ATG codon (bases 729-731) initiating 

33 transcription of the coding region of U2B M (not shown) . 
34 

35 Spliceosomal protein gene promoters (or other 

36 expression control polynucleotides) have the advantages 
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1 that (a) spliceosomal proteins are absolutely required 

2 and thus spliceosomal protein gene promoters are likely 

3 to be active in every cell and tissue type; (b) they 

4 are not derived from infectious agents which overcomes 

5 objections to the use of such sequences due to 

6 potential recombination; and (c) different genes or DNA 

7 sequences are likely to be expressed at different 

8 levels reflecting the relative abundance of the 

9 different spliceosomal proteins. 
10 

11 Modifications and improvements may incorporated without 

12 departing from the scope of the invention. For 

13 example, an expression control polynucleotide in 

14 accordance with the invention may be operably linked to 

15 a second polynucleotide which has some function other 

16 than coding for a polypeptide. One such example might 

17 be to operably link an expression control 

18 polynucleotide to a gene encoding a ribozyme or anti- 

19 sense RNA. Indeed it will be realised by the skilled 

20 man that the function of the gene under the control of 

21 the expression control polynucleotide of the invention 

22 is not important, and that the expression of many 

23 diverse genes can be controlled. 
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* 

1 Appendix 1 

2 TO-: TO+: 

3 4 ml solution 1 4 ml solution 1 

4 200 ml solution 3 4 ml solution 2 

5 200 ml solution 4 200 ml solution 3 

6 200 ml NAA (3 mg/ml) 200 ml solution 4 

7 200 ml BAP (1 mg/ml) 200 ml NAA (3 mg/ml) 

8 16 g Mannitol 200 ml BAP (lmg/ml) 

9 pH 5.5 adjusted with NaOH 16 g Mannitol 

10 filtersterilised +2 00 ml 4 g Sucrose 

11 cefotaxime (100 mg/ml) 40 ml Tween 20 

12 pH 5.5 adjusted with NaOH 

13 filtersterilised +200 ml 

14 cefotaxime (100 mg/ml) 

15 
16 

17 Solution l: Solution 2: 

18 10.3 0 mM NH 4 N0 3 100 MM FeS0 4 

19 9.40 mM KN0 3 100 /liM Na 2 EDTA 

20 1.50 mM CaCl 2 2H 2 0 

21 0.75 mM MgS0 4 .7H 2 0 

22 0.62 mM KH 2 P0 4 
23 

24 Solution 3: Solution 4: 

25 16.00 MM H3BO3 555.00 MM Inostol 

26 0.60 /iM MnS0 4 .H 2 0 3.00 mM Thiamine 

27 3.50 MM ZnS0 4 .7H 2 0 5.00 MM Pyridoxine 

28 0.12 MM CuS0 4 .5H 2 0 8.00 MM Niacin 

29 0.22 MM AICI3 2.00 MM Pantothenate 

30 0.13 MM NiCl 3 .6H 2 0 0-04 mM Biotine 

31 0.06 MM KI + 1 mg/Ml NaOH 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT :. 

(A) NAME: Scottish Crop Research Institute 

(B) STREET: Invergowrie 

(C) CITY: Dundee 

(E) COUNTRY: United Kingdom 

{F) POSTAL CODE (ZIP): DD2 5DA 

(G) TELEPHONE: +44 1382 562731 

(H) TELEFAX: +44 1382 561442 

(ii) TITLE OF INVENTION: Expression control polynucleotides derived 
from spliceosomal protein gene promoters 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solarium tuberosum 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
TTAGAATTAG AATCCCCATT TTTAAGAATA ATCCTAGATA ATTTTCTTAA ACATGACAAT 
TGATACCCAC AATTAATTAC TATTACATAA ATTTTTACCT AAATTAGATA TAACTTTCAA 
TTTCAAAAAT TAAAACCCAA AAAAATTGAA CGACAATACG AGAGGGGATC AAACATAGGC 



60 
120 
180 
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GAGCAATTAG AGAAATTGAC GGGTAGACAT CAACAAACCA TCAAGAATTT AAAAGCGGAA 240 

AGAGAAAAAA ATACACTATG GACGAATATT TTTATAGAAT TCAATATGTA AAACTAATAA 300 

ACAAGAAAGT AAATCATCTT TTATTCAAAG TAATGAAGAA GAAGAATTGA ATAAATATTT 360 

ACATAATCAA TAAAAAAAAC TCATTCAAAA GAATCGTGTG TATGGGAAAG AAGAAGAAAA 420 

AAAAGGCAGA AAAAAACCAC TTCCCAATAA AAAAGGACAT CATGCTGCCA CCTCCTAAAA 480 

TTATTTAATT TAATTAAAAA AAAAACTTCC CAACACGTGG GCTACTAATT GCAAAATTTA 540 

ATTTTTAAAA AGCTTTTTTT GTCAAGAAAA TAAAAGATGG CTATATGTTG CCAATTAGTA 600 

AAATGGGATG TCATGCTGTG TCATTTTTTC TTGAGTTGTT AAGGGCTCAA AGCCCAATTG 660 

TTTATCCAGC CCAAGCCCAA ATCGGAGCCC TATTCGTGCC CAAAAATTTC TGGAGAAATT 720 

AACGACAACT GAAGTTTCTA CCTCACCGGC GAAAGTTGCA GCTAGGCGTA GACGAGGAGC 780 

CATG 784 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCTCTCCAGC TCTTCCCTCC TAAAACAACC ATTTTATGAG TACAGACACA AACCAGCTTA 60 
GCAACCAGTA AATCCAAAAC TTTAATTCCA CGTGTAAGCG CTAACACTTC ACCCACTAA 119 
(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCTTTACACT ATTTTCAAAT AACGATGAGA CCTGTAATAA TGTAATAACT TGAAAATAGA 60 
ACAATAACTC ATTCAGTACA ACAAATAAAA TCATACTAAT GTATATTTTT AAAAACAATT 120 
TAACTCTATT TAATTAATC 139 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuberosum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATATCTAAAA AAATGTGATT GGAGCACTCA TTGACCACAC GAATGAATCT ACAATGTAGA 60 

TCATTCTATC TAGACAAATA ATGCAAAAAC TAAAAGATAA AAGTAATATT ATACTACAAT 120 

CTGTTAAATG ATATCAATAT TACAAAAGTT CTCTACGATG TCAATACATA TATTAAAATC 180 

TATTTGATTA ATCAGAAACA TATCATGTGT GAATTTTTTT AATTAAAGAT CCCTTTAATC 240 

ATCTGAATCA ACCTTGGCTG GTCTCACATC TTTCCACCCT CTACTCGGTC TTCACTTTCT 300 
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CTTAAACTAG GGAAGAACAA CATGATATTA GCTTAGATTA ATTAAACAAG CTCATCAAAA 360 

CTACCATCCA ATTTAAGCCA ATAATGTTTA AATAAAACAA AAAACAACGT ACTCATTTTT 420 

TCATAACAAG AAGTTAAAAT TTATATGAAT CCTTACTCCA AAAAAGAAGA AAAATTACAA 480 

TATCAATATA TATAACTACT CTATTTGGTT AGTCAACAAA ATGTTAGTAT ATGTATTGCA 540 

AGTTCGCAAC ACCCGCTTGG GCCTTGACCA CATATTTATA TGGGCCGGTT GTCAATTTAA 600 

GCCCACTTTG TGTTCGTTCG CCTTTCTTGT AGCTCCAAAC TCTTGGAAAT TTGTCGAGCA 660 

CATTCAGAAA TCACAGAGAA GAGCAAGTGA ATATACATAC AGATAGAGAA AAGCTG CTCT 720 

GCTCGGTAAT G 731 
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Claims 



3 



An expression control polynucleotide at least 
partially derived from a spliceosomal protein gene 
promoter. 

An expression control polynucleotide according to 
Claim 1 derived from a spliceosomal protein gene 
promoter of a plant. 

A recombinant polynucleotide comprising an 
expression control polynucleotide according to 
Claim 1 or Claim 2 operably linked to a second 
polynucleotide . 

A recombinant polynucleotide according to Claim 3 , 
wherein the second polynucleotide encodes a 
polypeptide . 

A recombinant polynucleotide according to Claim 3, 
wherein the second polynucleotide encodes a 
ribozyme. 

A recombinant polynucleotide according to Claim 3, 
wherein the second polynucleotide encodes anti- 
sense RNA, 

A recombinant vector containing an expression 
control polynucleotide according to Claim 1 or 
Claim 2, or containing a recombinant 
polynucleotide according to any one of Claims 3 to 

6. 

A method for producing a recombinant vector, said 
method comprising ligating an expression control 
polynucleotide according to Claim 1 or Claim 2 to 
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1 a vector. 

2 

3 9 A transformed host cell containing a recombinant 

4 polynucleotide according to any one of Claims 3 to 

5 6, or a recombinant vector as claimed in Claim 7. 
6 

7 10 A transgenic organism and/or the progeny and/or 

8 seeds thereof , containing a recombinant 

9 polynucleotide according to any one of Claims 3 to 
10 6, or a recombinant vector according to Claim 7. 
11 

12 11 A method for controlling the expression of a 

13 polypeptide from a polynucleotide encoding the 

14 polypeptide, said method comprising operably 

15 linking said polynucleotide to an expression 

16 control polynucleotide according to Claim 1 or 

17 Claim 2. 
18 

19 12 A spliceosomal protein gene promoter deposited as 

20 any one of NCTC 12864, NCTC 12865 and NCTC 12866. 
21 

22 13 A recombinant polynucleotide having a spliceosomal 

23 protein gene promoter according to Claim 12 . 

24 

25 14 A recombinant vector having a spliceosomal protein 

26 gene promoter according to Claim 12. 
27 

28 15 A transformed host cell containing a spliceosomal 

29 protein gene promoter as claimed in Claim 12, a 

3 0 recombinant polynucleotide as claimed in Claim 13 , 

31 and/or a recombinant vector as claimed in Claim 

32 14. 
33 

34 16 A spliceosomal protein gene promoter comprising a 

35 region of a plasmid in NCTC 12864 between an ATG 
3 6 codon at the start of the coding sequence and a 
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Pstl site 2.5 kb upstream of the ATG codon. 

A spliceosomal protein gene promoter comprising a 
region of a plasmid in NCTC 12865 between an ATG 
codon at the start of the coding sequence and an 
JEcoRI site 2 kb upstream of the ATG codon. 

A polynucleotide having any one of the sequences 
SEQ:ID:Nol, SEQ:ID:No2, SEQ:ID:No3, SEQ:ID:No4 and 
functional equivalents thereof. 
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